Poorly-conserved ORFs in the genome of the archaeon Halobacterium sp. NRC-1 correspond to expressed proteins
نویسندگان
چکیده
Motivation: A large fraction of Open Reading Frames (ORFs) identified as “hypothetical” proteins correspond to either “conserved hypothetical” proteins, representing sequences homologous to ORFs of unknown function from other organisms or to hypothetical proteins lacking any significant sequence similarity to other ORFs in the databases. Elucidating the functions and 3D structures of such orphan ORFs, termed ORFans or PCOs (poorly conserved ORFs), is essential for understanding biodiversity. However, it has been claimed that many ORFans may not encode for expressed proteins. Results: A genome-wide experimental study of “paralogous PCOs” in the halophilic archaea Halobacterium sp. NRC-1 was conducted. Paralogous PCOs are ORFs with at least one homolog in the same organism, but with no clear homologs in other organisms. The results reveal that mRNA is synthesized for a majority of the Halobacterium sp. NRC-1 paralogous PCO families, including those composed of relatively short proteins, strongly suggesting that these Halobacterium sp. NRC-1 paralogous PCOs correspond to true, expressed proteins. Hence, further computational and experimental studies aimed at characterizing PCOs in this and other organisms are merited. Such efforts could shed light on PCOs’ functions and origins, thereby serving to elucidate the vast diversity observed in the genetic material. Contact: J. Eichler, e-mail: [email protected]; D. Fischer, e-mail: [email protected]; B. Shaanan, e-mail: [email protected]
منابع مشابه
Poorly conserved ORFs in the genome of the archaea Halobacterium sp. NRC-1 correspond to expressed proteins
MOTIVATION A large fraction of open reading frames (ORFs) identified as 'hypothetical' proteins correspond to either 'conserved hypothetical' proteins, representing sequences homologous to ORFs of unknown function from other organisms, or to hypothetical proteins lacking any significant sequence similarity to other ORFs in the databases. Elucidating the functions and three-dimensional structure...
متن کاملThe cobY gene of the archaeon Halobacterium sp. strain NRC-1 is required for de novo cobamide synthesis.
Genetic and nutritional analyses of mutants of the extremely halophilic archaeon Halobacterium sp. strain NRC-1 showed that open reading frame (ORF) Vng1581C encodes a protein with nucleoside triphosphate:adenosylcobinamide-phosphate nucleotidyltransferase enzyme activity. This activity was previously associated with the cobY gene of the methanogenic archaeon Methanobacterium thermoautotrophicu...
متن کاملTranscriptional profiling of the model Archaeon Halobacterium sp. NRC-1: responses to changes in salinity and temperature
BACKGROUND The model halophile Halobacterium sp. NRC-1 was among the first Archaea to be completely sequenced and many post-genomic tools, including whole genome DNA microarrays are now being applied to its analysis. This extremophile displays tolerance to multiple stresses, including high salinity, extreme (non-mesophilic) temperatures, lack of oxygen, and ultraviolet and ionizing radiation. ...
متن کاملProteomic analysis of an extreme halophilic archaeon, Halobacterium sp. NRC-1.
Halobacterium sp. NRC-1 insoluble membrane and soluble cytoplasmic proteins were isolated by ultracentrifugation of whole cell lysate. Using an ion trap mass spectrometer equipped with a C18 trap electrospray ionization emitter/micro-liquid chromatography column, a number of trypsin-generated peptide tags from 426 unique proteins were identified. This represents approximately one-fifth of the t...
متن کاملMutS and MutL Are Dispensable for Maintenance of the Genomic Mutation Rate in the Halophilic Archaeon Halobacterium salinarum NRC-1
BACKGROUND The genome of the halophilic archaeon Halobacterium salinarum NRC-1 encodes for homologs of MutS and MutL, which are key proteins of a DNA mismatch repair pathway conserved in Bacteria and Eukarya. Mismatch repair is essential for retaining the fidelity of genetic information and defects in this pathway result in the deleterious accumulation of mutations and in hereditary diseases in...
متن کامل